Vowel landmark detection

نویسنده

  • Andrew Wilson Howitt
چکیده

Landmark based speech processing is a component of Lexical Access From Features (LAFF), a novel paradigm for feature based speech recognition. Detection and classi cation of landmarks is a crucial rst step in a LAFF system. This work tests the theoretical characteristics of vowels, and shows results for work in progress on a Vowel Landmark Detector. Acoustic theory predicts rst formant peaks in vowels, both in frequency and amplitude (at least for vowels between orally closed consonants). Formant tracking measurements found peaks in about 94% of vowels in the TIMIT database. Vowels which do not show a peak generally do not obey the theoretical assumptions, or are liable to formant tracker error due to nasalization, glottalization, or aspiration. Amplitude peaks are more reliable than frequency peaks. Peaks tend to occur early in the vowel, and frequency peaks tend to occur slightly before amplitude peaks. A xed spectral band gave performance comparable to the formant tracker for this task, allowing a simpler detection algorithm. Previous work on a Vowel Landmark Detector is extended by use of a multilayer perceptron (MLP) to combine knowledge-based acoustic cues. The MLP decreases error rate to about 12%, of which about 8% are deletions. Since about 6% of vowels had no detectable peak, this performance is close to the expected limit of a peak picking algorithm. Work is continuing on algorithm improvements, including the output of con dence scores.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mandarin Chinese tone nucleus detection with landmarks

This paper discusses a new approach to improve tone recognition by modeling the tone nucleus with vowel landmark detection. The tone nucleus region is identified based on vowel landmark frames derived by an automatic landmark recognition system. In the corresponding tone recognition experiments, the best results with landmark-based tone nucleus regions outperform the best baseline system result...

متن کامل

Automatic Syllable Detection for Vowel Landmarks

Lexical Access From Features (LAFF) is a proposed knowledge-based speech recognition system which uses landmarks to guide the search for distinctive features. The first stage in LAFF must find Vowel landmarks. This task is similar to automatic detection of syllable nuclei (ASD). This thesis adapts and extends ASD algorithms for Vowel landmark detection. In addition to existing work on ASD, the ...

متن کامل

Robust Vowel Landmark Detection Using Epoch-Based Features

Automatic detection of vowel landmarks is useful in many applications such as automatic speech recognition (ASR), audio search, syllabification of speech and expressive speech processing. In this paper, acoustic features extracted around epochs are proposed for detection of vowel landmarks in continuous speech. These features are based on zero frequency filtering (ZFF) and single frequency filt...

متن کامل

Nasal detection module for a knowledge-based speech recognition system

The Lexical Access From Features (LAFF) project tries to model the representation and perception of speech by human listeners. The derivation of such a representation involves first finding certain acoustic landmarks. Based on the landmarks and the acoustic cues surrounding the landmarks, distinctive features of the speech segments may be deciphered. The present study concentrates on the nasali...

متن کامل

A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition.

A probabilistic framework for a landmark-based approach to speech recognition is presented for obtaining multiple landmark sequences in continuous speech. The landmark detection module uses as input acoustic parameters (APs) that capture the acoustic correlates of some of the manner-based phonetic features. The landmarks include stop bursts, vowel onsets, syllabic peaks and dips, fricative onse...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999